Region-Based Incremental Pruning for POMDPs
نویسندگان
چکیده
We present a major improvement to the incremental pruning algorithm for solving partially observable Markov decision processes. Our technique targets the cross-sum step of the dynamic programming (DP) update, a key source of complexity in POMDP algorithms. Instead of reasoning about the whole belief space when pruning the cross-sums, our algorithm divides the belief space into smaller regions and performs independent pruning in each region. We evaluate the benefits of the new technique both analytically and experimentally, and show that it produces very significant performance gains. The results contribute to the scalability of POMDP algorithms to domains that cannot be handled by the best existing algorithms.
منابع مشابه
Tree-based pruning for multiagent POMDPs with delayed communication
Multiagent POMDPs provide a powerful framework for optimal decision making under the assumption of instantaneous communication. We focus on a delayed communication setting (MPOMDP-DC), in which broadcast information is delayed by at most one time step. Such an assumption is in fact more appropriate for applications in which response time is critical. However, naive application of incremental pr...
متن کاملIncremental Pruning : A Simple , Fast , Exact Method for PartiallyObservable Markov Decision
Most exact algorithms for general partially observable Markov decision processes (pomdps) use a form of dynamic programming in which a piecewise-linear and convex representation of one value function is transformed into another. We examine variations of the \incremental pruning" method for solving this problem and compare them to earlier algorithms from theoretical and empirical perspectives. W...
متن کاملAccelerated Vector Pruning for Optimal POMDP Solvers
Partially Observable Markov Decision Processes (POMDPs) are powerful models for planning under uncertainty in partially observable domains. However, computing optimal solutions for POMDPs is challenging because of the high computational requirements of POMDP solution algorithms. Several algorithms use a subroutine to prune dominated vectors in value functions, which requires a large number of l...
متن کاملIncremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes
Most exact algorithms for general par tially observable Markov decision processes (POMDPs) use a form of dynamic program ming in which a piecewise-linear and con vex representation of one value function is transformed into another. We examine vari ations of the "incremental pruning" method for solving this problem and compare them to earlier algorithms from theoretical and em pirical persp...
متن کاملRegion-Based Dynamic Programming for POMDPs
We present a major improvement to the dynamic programming (DP) algorithm for solving partially observable Markov decision processes (POMDPs). Our technique first targets the cross-sum pruning step of the DP update, a key source of complexity in POMDP algorithms. Unlike previous approaches, which reason about the whole belief space, the algorithms we present divide the belief space into smaller ...
متن کامل